Picture for Siyuan Zhang

Siyuan Zhang

Reasoning as State Transition: A Representational Analysis of Reasoning Evolution in Large Language Models

Add code
Jan 31, 2026
Viaarxiv icon

STEP3-VL-10B Technical Report

Add code
Jan 15, 2026
Viaarxiv icon

REOBench: Benchmarking Robustness of Earth Observation Foundation Models

Add code
May 22, 2025
Viaarxiv icon

From Structural Design to Dynamics Modeling: Control-Oriented Development of a 3-RRR Parallel Ankle Rehabilitation Robot

Add code
May 19, 2025
Viaarxiv icon

Log Optimization Simplification Method for Predicting Remaining Time

Add code
Mar 10, 2025
Viaarxiv icon

Self-Memory Alignment: Mitigating Factual Hallucinations with Generalized Improvement

Add code
Feb 26, 2025
Viaarxiv icon

STAIR: Improving Safety Alignment with Introspective Reasoning

Add code
Feb 04, 2025
Figure 1 for STAIR: Improving Safety Alignment with Introspective Reasoning
Figure 2 for STAIR: Improving Safety Alignment with Introspective Reasoning
Figure 3 for STAIR: Improving Safety Alignment with Introspective Reasoning
Figure 4 for STAIR: Improving Safety Alignment with Introspective Reasoning
Viaarxiv icon

Parametric $ρ$-Norm Scaling Calibration

Add code
Dec 19, 2024
Viaarxiv icon

MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition

Add code
Apr 29, 2024
Figure 1 for MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition
Figure 2 for MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition
Figure 3 for MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition
Figure 4 for MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition
Viaarxiv icon

Exploring the Transferability of Visual Prompting for Multimodal Large Language Models

Add code
Apr 17, 2024
Figure 1 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Figure 2 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Figure 3 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Figure 4 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Viaarxiv icon